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ABSTRACT 

Aiming to correctly restore the redshifted 21 cm signals emitted by the neutral 
hydrogen during the cosmic reionization processes, we re-examine the separation 
approaches based on the quadratic polynomial fitting technique in frequency 
space to investigate whether they works satisfactorily with complex foreground, 
by quantitatively evaluate the quality of restored 21 cm signals in terms of sam- 
ple statistics. We construct the foreground model to characterize both spatial 
and spectral substructures of the real sky, and use it to simulate the observed 
radio spectra. By comparing between different separation approaches through 
statistical analysis of restored 21 cm spectra and corresponding power spectra, 
as well as their constraints on the mean halo bias b and average ionization frac- 
tion Xe of the reionization processes, at z = 8 and the noise level of 60 mK 
we find that, although the complex foreground can be well approximated with 
quadratic polynomial expansion, a significant part of Mpc-scale components of 
the 21 cm signals (75% for > 6h~^ Mpc scales and 34% for > lh~^ Mpc scales) is 
lost because it tends to be mis-identified as part of the foreground when single- 
narrow-segment separation approach is applied. The best restoration of the 21 
cm signals and the tightest determination of b and be obtained with the 

three-narrow-segment fitting technique as proposed in this paper. Similar results 
can be obtained at other redshifts. 
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1. INTRODUCTION 

In the late stage of the dark ages of the Universe, neutral hydrogen in the intergalactic 
medium (IGM) began to be reionized by UV and soft X-ray photons (Baek et al. 2010) 
emitted by the first-generation sources, such as stars and/or quasars (see Morales & Wyithe 
2010 for a recent review). This period, which started sometime from about 150 Myrs to 500 
Myrs since the Big Bang, is known as the epoch of reionization (EoR), and is one of a few 
most important milestones in the evolution histories of both our Universe and galaxies. The 
study of the line emission of neutral hydrogen at 21 cm (21 cm signals hereafter) coming 
from the surroundings of ionizing sources in EoR provides us with a novel opportunity to 
reveal the properties of the first-generation stars and/or quasars, as well as to constrain 
cosmological models (e.g., Iliev et al. 2002; Furlanetto et al. 2004a; Santos & Cooray 2006; 
Morales & Wyithe 2010). EoR is expected to be a major center of interest within the next 
one or two decades for the facilities working in the low-frequency radio band (< 1 GHz), such 
as the 21 Centimeter Array (21CMaE|), the Low Frequency Array (LOFArI), the Murchison 
Wide-field Array (mwaI), and the Square Kilometer Array (SKAo). 

One of the serious challenges in observing the 21 cm signals from EoR is that they are 
extremely weak (~ 10 mK). Having been redshifted to the low-frequency radio band (50—200 
MHz, for 6 < z < 27; Morales & Wyithe 2010; Field 1959a, 1959b; Scott & Rees 1990; Madau 
et al. 1997), the 21 cm signals are drowned in the luminous foreground, which consists of 
our Galaxy, extragalactic discrete sources (e.g., radio galaxies and active galactic nuclei, i.e., 
AGNs), galaxy clusters, and other relatively minor contaminating sources. Theoretical and 
numerical efforts have been made in the past few years to probe how to correctly restore the 
redshifted 21 cm signals from this overwhelmingly luminous foreground, e.g., extraction of 
the 21 cm signals in either the angular power spectrum space (e.g., Di Matteo et al. 2002, 
2004; Santos et al. 2005; McQuinn et al. 2006), or the frequency space (e.g., Wang et al. 
2006; Gleser et al. 2008; Jehc et al. 2008; Bowman et al. 2009; Harker et al. 2009a; Liu et 
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al. 2009; Petrovic & Oh 2011), both on the a priori assumption that the integral foreground 
emission can be well described as a smooth continuum. 

For the separation works in the frequency space, the total spectrum is usually fitted 
with a smooth function (e.g., an n-order polynomial model logT^'^j.^^u) = a() + J2f=i^ ^t? log 
where i = I, 2, A'poiy) to describe the integral foreground brightness temperature Tforc(z^)- 
However, the first thing one shall keep in mind is that the foreground models adopted by pre- 
vious works are relatively simple more or less, e.g., extragalactic discrete sources are treated 
as point sources, showing no substructure such as lobes and jets that are often observed in 
nearby FRI and FRII radio galaxies, and/or possessing simple, featureless power-law radio 
spectra. Meanwhile, galaxy clusters, which contribute to the foreground at a non-negligible 
level compared with AGNs, are not taken into account, or treated as simple power-law spec- 
trum sources. A revisit to the separation approach with more complex and more realistic 
foreground models is necessary. Secondly, a variety of theoretical and numerical works (e.g., 
Furlanetto et al. 2004b; Wyithe & Loeb 2004; Furlanetto & Oh 2005; Petrovic & Oh 2011) 
have already indicated that in late phases of the cosmic reionization, the typical sizes of 
ionized regions are of the order of a few tens of comoving Mpc, so that the 21 cm signals 
possess slowly-varying components at ~ 1 MHz scales in frequency space due to a specific 
distribution of HII bubble sizes. For example, Mesinger & Furlanetto (2007) predicted that 
at 2; = 8 the diameter distribution of the HII bubbles is peaked at D^^ic\z=s — 16 Mpc (cor- 
responding to a frequency span of Au^'^^^^ ^ 0.9 MHz), along with a significant tail out to 
-^bXbiel^=8 — 60 Mpc {A.i'^^^]^^e — 3-4 MHz). Apparently, the non- negligible slowly- varying 
21 cm components contributed by these giant bubbles should have been entangled with the 
foreground in frequency space, and most of them are bound to be abandoned in foreground 
subtraction, especially when a single-narrow-frequency segment (e.g., Az/gegmcnt — 2 MHz as 
used in Wang et al. 2006) is adopted in the quadratic polynomial fittings. In fact, there is 
a significant power loss on Mpc scales as will be shown in §3.2.1. 

In order to address the problems raised above, we attempt to re-examine and improve 
the polynomial fitting algorithm by applying a more complex foreground emission model 
that contains detailed emulations of both spatial and spectral features of the real sources 
(see Wang et al. 2010 for details, which followed Snellen et al. (2000), Giardino et al. (2002), 
Finkbeiner (2003), Wilman et al. (2008), and references therein; W2010 hereafter), mean- 
while employing a multi-narrow-frequency segment quadratic polynomial fitting technique 
to remove the foreground emission components in frequency space. We find that our new al- 
gorithm can reasonably correct the systematic bias introduced by the single-narrow-segment 
algorithm, which may lead to a reduction of the 21 cm signal power by about 75% for > 6h~^ 
Mpc scales and 34% for > lh~^ Mpc scales, and therefore a significant underestimate of both 
the HII bubble size and growth rate. We also prove that at z = 0.5 our algorithm is efficient 
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and reliable in restoring mass distribution on > 30h~^ Mpc scales (by contrast about 19% 
power will be lost with the single-segment approach), thus it can be applied to extract the 21 
cm signals emitted at intermediate redshifts to investigate the baryon acoustic oscillations 
(BAO), which exhibit a typical scale of ~ 150 comoving Mpc (Morales & Wyithe 2010; 
Ansari et al. 2012). 

In parallel to the polynomial fitting technique, recently a series of works have been 
published in an attempt to develop non-parametric techniques and apply them in observing 
the 21 cm signals, which do not assume a specific form for the contaminating foregrounds. 
Using Wp smoothing method, Harker et al. (2009b, 2010) preferentially consider foreground 
models with as few inflection points as possible, which when apply to simulated LOFAR-EoR 
data compare very favorably with parametric methods. FastICA as presented in Chapman 
et al. (2012) accurately recovers the 21 cm power spectra by considering the statistically 
independent components of the foregrounds. With Generalized Morphological Component 
Analysis (GMCA), Chapman et al. (2012b) not only recovers the power spectra to high 
accuracy but also recovers the simulated 21 cm signal maps exceedingly well using wavelet 
decomposition. More detailed discussion on the non-parametric techniques can be found in 
the works listed above, and will not be presented here because it is beyond the scope of this 
paper. 

Throughout the work we adopt h = 0.71, f^M = 0.27, f^A = 0.73, and Qb = 0.044. 
Unless otherwise stated, all errors are quoted at 68% confidence level. 

2. SIMULATION OF LOW-FREQUENCY RADIO SPECTRA 

2.1. 21 cm Signals of Neutral Hydrogen from EoR 

According to Wang et al. (2006) and references therein, the three-dimensional (3D) 
power spectrum of the 21 cm signals from EoR at redshift z shall take the form of 

P3B,2iUk,z) = (0.016 Kr^[^y^-^^{[l-x^{z)f + b\z)e-'''^'^^^xl{^^^ 

(1) 

where Psd, matter {k, z) is the 3D matter power spectrum at redshift z, Xc{z) is the average 
ionization fraction, b{z) is the mean halo bias (the mean ratio of the mass over-densities of 
halo to field weighted by different halos), and R{z) = 100 [1 — Xe{z)]~^^^ kpc is the mean 
radius of the ionized patches in HIT regions. To calculate -P3D,2icm(^7 -z), we determine b{z) 
and Xc{z) by adopting the parameters used in the fiducial reionization model of Santos et 
al. (2003, 2005). At any given redshift Zq, we consider a redshift segment defined between 
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zo and zq + A^scgmcnt (corresponding to reference frequency z/q by z/q — Az/scgmcnt < ^ ^ 
since u = 1420.4/(1 + z) MHz for the redshifted 21 cm signals), where Azgcgmcnt <^ 1 so that 
-P3D,2icm(^5 ^o) is roughly uniform and isotropic within the segment A^rgegment or Az^segment 
(i.e., the impact of evolution effects can be ignored across Azsegment)- In this narrow redshift 
range we are allowed to convert the derived -P3D,2icm(^, ^o) into the one-dimensional (ID) 
power spectrum using the integration formula 

r+oo 

PlD,21cm(A:) = / P3B,2lUk',Zo)k'dk' (2) 

Jk 

(Peacock 1999). The ID power spectra are shown in Figure 1 for some typical redshifts. 
The spectrum (i.e., line-of-sight distribution) of the redshifted 21 cm signals in the segment 
uq — Ai^scgmcnt < z^ < t'o is then calculated through the inverse Fourier transform 

v^2 — 2 2 

T21cm(a;n) = — ^ [^g(^'lD,21cm(fc, ^^o)) COS j + 5^ (PlD,21cm (fc, ^^o)) slu (^^n) , 

q=0 

(3) 

where L is the comoving space scale corresponding to Azgegment, terms Ag{Pii) 2icra{k, zq)) 
and -Bg(-PiD,2icm(^, ^o)) are independent and random parameters sharing the same Gaussian 
distribution A/'(0, v^-PiD,2icm(^5 Zo)/2), k = 2nq/L is the wave number, x„ {n = 1, . . . , A^) 
is the line-of-sight location related to frequency Vn under linear approximation in frequency 
space, and is the number of frequency channels for the segment chosen large enough so 
that most information of Pm,2icm{k, z) is included within the range 2tt/L < k < 2ttN/L. 
We adopt a segment width of Az/segment = 2 MHz and = 50, therefore the frequency 
resolution (i.e., the width of each frequency channel) is dp = Az/scgmcnt/-^ = 40 kHz, which 
is a sufficiently high frequency resolution in order to reconstruct the local power spectrum 
for the give A^segment at z. 



2.2. Foreground Emission Components and Instrumental Noise 

2.2.1. Foreground Components Simulation 

The foreground emission components that contaminate the 21 cm signals have been 
continuously discussed and simulated in detail in many works (e.g., Jelic et al. 2008; Gleser 
et al. 2008; Bowman et al. 2009; Jelic et al. 2010). In our previous work (W2010), we 
fully took into account the effects of random variations of model parameters in the ranges 
allowed by the observations in Monte-Carlo simulations to model the emissions from our 
Galaxy, galaxy clusters, and discrete sources (i.e., star-forming galaxies, radio-quiet AGNs, 
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and radio- loud AGNs), and constructed the 50 — 200 MHz radio sky maps with a frequency 
resolution of 40 kHz, following the works of Snellen et al. (2000), Giardino et al. (2002), 
Finkbeiner (2003), Wilman et al. (2008), and reference therein. All the simulated sky 
maps are plotted in a circular field of view (FOV) with a radius of 5° centered at the north 
celestial pole, as currently being observed with the 21CMA array, and the obtained images 
are pixelated with grids of 1024 x 1024 pixels, each image pixel covering approximately a 
0.6' X 0.6' patch. We adopt the same model to create the contaminating foreground emission 
components used in this paper. To compare with Wang et al. (2006), the size of each sky 
pixel is fixed to 12 arcmin^, which is no smaller than the area covered by the point spread 
function of any operating or upcoming facility designed to detect EoR signals. We note that 
the foreground simulations in Wang et al. (2006) did not include discrete sources brighter 
than 0.1 mJy at 150 MHz, because the authors considered that all of them could be identified 
and removed safely before the step of signal separation. In our work, however, we assume a 
conservative flux cut of 10 mJy at 150 MHz, since technically it is more realistic with current 
instrumentations (Liu et al. 2009; Pindor et al. 2011). The brightness temperatures of the 
simulated foreground at 150 MHz are listed in Table 1. 



2.2.2. Instrument and Noise 

The 21CMA array is a low-frequency radio interferometer array constructed in a remote 
area of Xinjiang, China. The array consists of 81 telescopes that are distributed in a "T" 
shape with the north-south and east-west baselines being 6 km and 4 km, respectively 
(Figure 2), and each telescope is an assembly of 127 logarithmic periodic antennas. The 
21CMA array is designed to continuously observe the same patch of the sky around NCP 
(i.e., 24 hrs a day). The integration uv coverage of 24 hrs is shown in Figure 3. The thermal 
noise of 21CMA can be expressed in terms of brightness temperature as 

T X2 

-'■sys'* /.N 

C"noise ~ ~~ / = 

Aes^l^y n[n — Ijrdu 

(e.g., Thompson et al. 2001; W2010), where Tgys is the system temperature, A is the wave 
length, Acs is the effective area of single radio telescope, Q is the beam solid angle, r is the 
effective integral time, and n is the number of single radio telescopes. For 21CMA, we have 
Tsys = 300 K, Aeff = 218 m^, and n = 81 (Table 2), thus 

A V /I MHzV^^ /30 daysV^^ fl x 10"^ sr^ 



a 



noise 



Supposing a typical 21CMA survey of du = 40 kHz, r = 1 yr, and f2 = 1 x 10 ^ sr (i.e., 12 
arcmin^), the noise is criioise|y=i57gMHz = 60 mK at A = 1.9 m (corresponding to z/ = 157.8 
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MHz) where the 21 cm signals emitted at z = 8 are expected to appear. Under the same 
observation conditions the noise at any frequency is given as o"noisc(i^) = 60 niK ( 157 s'^mhz )"^" 
Following McQuinn et al. (2006), Chapman et al. (2012), and Zaroubi et al. (2012), the noise 
at each frequency v is simulated as a Gaussian random field in the 21CMA uv plane, and 
then transformed to the image plane to obtain the noise Tnoise(^), whose root-mean-square 
(rms hereafter) value is normalized to o^oi^ciy)- 



3. SIGNAL SEPARATION 

3.1. Single-Segment Fits to Pure Complex Foreground 

To extract the foreground components, n-order polynomial models are usually applied 
in the frequency space, e.g., Wang et al. (2006) performed a logarithmic form 

log Tl^^Jyv) = ao + ai log z/ 02 log^ v (6) 

to approximate the integral foreground in a single segment with a width of Az/gegmcnt — 2 
MHz for = 8, where (z = 0, 1, 2) are the parameters all obtained by applying the least 
squares fitting method. To determine if this approximation is appropriate when complex 
foreground is introduced, and if it can be applied to a wider frequency segment of a width 
~ 10 MHz as will be discussed in §3.2 and §4.1, we fit the pure simulated integral foreground 
in the logarithmic frequency space using linear polynomial {\a^Tl^^Jyy) = oq + ailogz/), 
quadratic polynomial (Eq. (6)), and cubic polynomial {Vo^Tl^^Jyv) = ao + ai log i/-|-a2 log^ z/-|- 
aalog^i^), respectively. Each of the three polynomial fittings is applied to 1000 randomly 
selected sky pixels on the simulated foreground map with different segment widths (i.e., 
Ai/scgmont = 2i MHz for zq = 8, where i = 1, 2, ■ ■ ■ , 40) with central frequency located at 
z/q = 1420.4 MHz/(l -|- zq). As shown in Figure 4, the intensity residuals of linear polynomial 
fittings can reach up to ~ 10 mK in many cases, which is comparable with the intensity of 
the 21 cm signals, whereas the residuals of quadratic polynomial fittings (< 1 mK) are small 
enough to be negligible compared to the intensities of both the instrumental noises and the 
21 cm signals. Since the quadratic polynomial fitting is good enough to approximate the 
complex foreground for a large range of Az/scgmcnt, we decide that the cubic one with even 
smaller residuals is unnecessary in our work. 

In Figure 4, it is also shown that the residuals, although statistically small enough to 
make the quadratic polynomial fittings acceptable when Ai/segment < 100 MHz, increase as 
the segment broadens. We ascribe this to the fact that our foreground model (for details 
see §2.2.1 and W2010) is complex, thus a broader segment means that more complicated 
behaviors of the foreground have been involved to cause larger uncertainties. We note that 
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the clustering of the extragalactic discrete sources and galaxy clusters, which has been taken 
into account in our sky simulation, does not lead to significant residuals to overwhelm the 21 
cm signals in the separation process. Apparently, there may exist other ways to exclude the 
complex foreground, e.g., the non-parametric technique (e.g., Harker et al. 2009b; Chapman 
et al. 2012), but it is beyond the scope of this paper to test and discuss them. 

3.2. Separation at Noise Level of 60 mK 

The ideal low noise level as applied in some foregoing works (e.g., Wang et al. 2006) 
might not be achieved until the next generation facilities have been built. For currently 
operating facilities, such as 21CMA and LOFAR, the noise will dominate the 21 cm signals 
from the EoR, so that direct restoration of the 21 cm signals is precluded (e.g., McQuinn 
et al. 2006; Gleser et al. 2008; Bowman et al. 2009; Harker et al. 2009b; Chapman et 
al. 2012). To study this case, here we assume a noise level of o"noise|2=8 = 60 mK, which is 
achievable with a one-year observation of 21CMA (§2.2.2), and investigate how to restore 
the 21 cm signals. 

3.2.1. Separation in Single- Narrow- Frequency Segment 

As a first step, we apply the separation approach on a single segment with a narrow 
width of Az/gegment = 2 MHz (the channel width du = Ai/gcgment/A^ = 2 MHz/50 = 40 kHz), 
to examine if the information on the fiuctuations of the 21 cm signals on Mpc scales is lost 
when our complex foreground model is applied. The segment width of 2 MHz is adopted 
to guarantee that the power spectrum of the 21 cm signals is roughly uniform within the 
corresponding redshift segment (A^scgmcnt — 0.1) and -PiD,2icm(^) can be measured accurately. 

We fit the total (foreground + noise + 21 cm signals) brightness-temperature spec- 
trum Ttot{j^) with Equation (6) to determine the best-fit foreground T/^j.^^u), and derive the 
residuals (raw spectrum hereafter) T|f^(i/) = Ttot(i^) — T/^rd^)' which contain the 21 cm 
signals and a dominating noise component (i.e., T|f^(i/) = T2ic^{i') + ^noise('^))- Then, we 
transform the raw spectrum into the corresponding power spectrum. By subtracting the 
theoretically predicted noise component from the averaged power spectrum for a sufficiently 
large number of sky pixels, we may obtain a mean restored power spectrum for the 21 cm 
signals (e.g., Harker et al. 2010; Chapman et al. 2012). When comparing the results with 
the input ID power spectra created in the simulations (§2.1; Fig. 1), we find that, when 
10000 sky pixels (12 arcmin^ for each) that cover about 1/3 FOV of the 21CMA (about 33 
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deg^) are in use, the power spectrum of the 21 cm signals can be well recovered on small 
scales of < 3h~^ Mpc, whereas on larger scales the deviation is huge (reduced = 5196; 
Figure 5a). We measure the relative Mpc-scale power restoration in terms of the ratio 

of restored to simulated powers as w\k<ih/Mpc = ^^\k<ih/Mpc = f^^^''^p^l["lfk)7k ' ^^^^^ 

-P2icm|A;<ih./Mpc = j^^ j PiD ,2icia{k)dk is the Mpc-scale power of the input 21 cm signals. We 
find that the mean value is w\k<ih/M-pc = 0.25±0.04 for the total 10000 cases, which indicates 
that the use of single-narrow-segment quadratic polynomial fitting technique may cause an 
average power loss on Mpc scales (i.e., tyioss|fc<ih/Mpc = 1 — w;|fc<i/i/Mpc) of about 75% in the 
redshifted 21 cm signals. Since the existence of such Mpc-scale structures is expected to be 
quite common in late phases of EoR (e.g., Furlanetto et al. 2004b; Petrovic & Oh 2011), 
part of the 21 cm signals is bound to be abandoned along with the foreground if a separation 
algorithm based on single-narrow-segment quadratic polynomial fittings is applied. 

We exclude the possibility that such a Mpc-scale bias is caused by the complex fore- 
ground employed in this work, simply by replacing our foreground with that of Wang et 
al. (2006) 's and obtaining similar results. An enlargement of Ai^segmcnt to the order of 10 
MHz has been suggested in some previous work (e.g., Harker et al. 2009b; Petrovic & Oh 
2011) to mitigate the Mpc-scale bias. However, as will be discussed in §3.2.3 and §4, we do 
not suggest this method because it will introduce too many auxiliary frequency channels in 
calculation, a relatively larger bias of w|fc<ih/Mpc, and a relatively poorer determination of 
the physical conditions of EoR when compared with the three-narrow-segment separation 
approach. 



3.2.2. Separation in Three- Narrow- Frequency Segment 

In the following separations, we attempt to introduce more segments and frequency 
channels into the spectral fittings as auxiliaries to determine the foreground more accurately. 
Inspired by the fact that the complex foreground can be well approximated with quadratic 
polynomial expansion in frequency ranges much wider than the largest scales of the 21 
cm signals (§3.1), we propose to apply a multi-narrow-segment (hereafter multi-segment) 
quadratic polynomial fitting technique as an attempt to solve the problems raised in §3.2.1. 
The frequency resolution and the number of channels in each segment are dv = 40 kHz and 
= 50, respectively, and the span of the segments is of the order of 10 MHz. We carry out 
the foreground fittings by using the same quadratic polynomial as formulated in Equation 
(6) to approximate the low-frequency foreground simultaneously in three narrow segments, 
which are separated far away enough to guarantee that the 21 cm signals in different segments 
come from different ionized bubbles. Taking the redshift zq = 8 (i.e., reference frequency 
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z/q = 1420.4 MHz/(l + 2;o) = 157.8 MHz) as an example, we define the three segments in such 
a way that each of them possesses the same bandwidth of Ai^scgmcnt = 2 MHz as in §3.2.1, 
and they are separated by a gap measured either in redshift (i.e., a segment combination of 
^^segment = {^0; ^i, Z2} . Here zq is the reference point for segment one, and zi = zq + Azgap 
and Z2 = Zq — Azgiip for segment two and three, where A^gap = 0.5, 1.0, 1.5, respectively) 
or in frequency (i.e., a segment combination of i^segmcnt = {^o, i^i, ^2}- Here is the 
reference point for segment one, and = uq + Ai/gap and 1^2 = ^0 — Ai/gap for segment 
two and three, where Az/gap = 10, 20, 30 MHz, respectively). We use the same method as 
described in §3.2.1 to restore the 21 cm signals, and evaluate the quality of the results in 
1^0—2 MHz < u < 1^0 using the reduced ■ In general the results are significantly better than 
those of the single-narrow-segment and there exists a very weak dependence of separation 
quality on the segment gap size. Taking the segment combination of ^rgegment = {8, 9, 7} for 
example, the power spectrum of the 21 cm signals can be recovered very well with the three- 
narrow-segment approach (reduced = 1-137 and w\k<ih/Mpc = 1-00 ± 0.05; Figure 5c), 
which is significantly better than that obtained with the single-narrow-segment separation 
(x^ = 5196 and w\k<ih/Mpc = 0.25 ± 0.04), especially on large scales. It is found that the use 
of more than three segments is redundant. It does not help improve the separation quality 
but lower the efficiency of calculation. 



3.2.3. Single- Segment Separations with a Larger Ai/geg^ent 

In order to reduce the systematic bias introduced by possible mis-subtraction of the Mpc- 
scale components in redshifted 21 cm signals in single-narrow-segment quadratic polynomial 
fittings, Petrovic & Oh (2011) qualitatively mentioned that a larger segment width should 
be adopted, especially in restoring the 21 cm signals in late reionization stages when the 
HII bubble size becomes sufficiently large. The similar problem was also briefly mentioned 
by Wang et al. (2006), who suggested an enlargement of Az/sogmcnt by a factor of ten. To 
investigate if this is feasible we repeat the single-segment separation calculations described 
in §3.2.1 for another 10000 rounds by increasing the segment width to Ai^scgmcnt = 20 MHz 
(A^ = Ai/scgment/c^i^ = 500) and examine the changes in the results. The goodness of fittings 
is much better than that of fittings with single-narrow-segment fittings (Table 3), which 
supports the ideas of Petrovic & Oh (2011) and Wang et al. (2006). However, we note that 
in single-wide-segment fittings the goodness of both the reduced = 1-365 and w|fc<i/i/Mpc = 
0.97±0.05 (Figure 5) are a bit worse when compared with those of our three-narrow-segment 
fittings (reduced x"^ = 1-137 and w\k<ih/Mpc = 1-00 ± 0.05). In addition, as will be shown in 
§4.2, single-wide-segment quadratic polynomial separation technique cannot yield the best 
constraints on the physical conditions of EoR. 
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We have also tried to enlarge du from 40 kHz to 100 kHz, which would reduce the 
required observing time to achieve crnoisc|!/=i57.8MHz = 60 mK. The results are shown in Figure 
5(d)-(f), in which we can find our three-narrow-segment method still works well, whereas 
the information of smaller scales would be lost due to the sampling reduction of frequency. 



4. DISCUSSION 

4.1. Further Comparison between Different Approaches 

The differences between the results obtained with single- segment and multi-segment 
separation approaches can be explained as follows. In the late stage of cosmic reionization 
(since z < 9 depending on models; e.g.. Morales & Wyithe 2010 for a recent review; Santos et 
al. 2008) most of the HII bubbles have been well developed. For example, by simulating the 
reionization process using the Lagrangian perturbation theory, Mesinger & Furlanetto (2007) 
predicted that at 2; = 8 the bubble size -Dbubbic (corresponding to a frequency span of Ai/bubbic) 
should range from I^^j^bieU=8 ^ 1 Mpc (Ai/-j^biel.=8 ^ 0.05 MHz) to I?SbicU=8 ^ 60 
Mpc (Az/S,iel.=8 - 3.4 MHz) with a clear peak at D^^.J^^s - 16 Mpc {AuZ^,J,=8 - 
0.9 MHz) in number distribution. This means that the 21 cm signals can possess slowly- 
varying components in frequency space on > 1 MHz scales. In single-narrow-segment (e.g., 
Ai/segment = 2 MHz) Separations, the HII bubble size Az/bubbic is comparable to or even 
larger than the chosen Az/segment- This makes it inevitable that a significant part of the 
Mpc-scale 21 cm components is treated as a part of foreground, which is smooth on ~ 100 
MHz scales, and then mis-subtracted (Fig. 6). In single-segment separations with a larger 
Ai^scgment, the Segment width increases to Ai^scgmcnt ^ Az/bubWc, the contributions of 
more than one HII bubbles located along line of sight are involved in the spectra to be 
fitted. Since the bubble and foreground components vary on quite different scales within 
the segment, the problem of mis-subtraction can be significantly mitigated, except that the 
power restorations for Mpc-scale components are not fully constrained (Table 3), mostly 
due to the appearance of giant bubbles that contribute smooth components on tens of Mpc 
scales. In three-narrow-segment separation fittings with narrow Ai^segmcnt, the Mpc-scale 21 
cm components in different segments come from different bubbles, i.e., they do not correlate 
with each other whereas the foreground components in the three segments do. Therefore the 
foreground and 21 cm signals are best determined, respectively. 

This can also be clearly seen when the noise is sufficiently low (e.g., crnoisc|2=8 = 1 mK), 
although the realistic condition (e.g., crnoise|2=8 = 60 mK) follows the same rule. Taking 
c"noise|2=8 = 1 as an example, we repeat the separation using single-narrow-segment, 
three-narrow-segment, and single-wide-segment separation approaches with 1000 rounds, re- 
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spectively. We evaluate the result of each round in a quantitative way by calculating the rela- 



tive root-mean-square error (RRMSE hereafter) as follows, rgt = — ^^^^2\''\v)\ ""^ • 

With the single-narrow-segment approach, only 25% of the 1000 cases can yield accept- 
able (rgt < 0.1; Figure 7 and see an example in Figure 8) results. We find that failed 
(rgt > 0.1; Figure 7 and see an example in Figure 9) restorations tend to occur when the 
21 cm signals possess significant large-scale {k < \h Mpc~^) components along line of sight, 
which contribute a non-negligible part to the simulated 21 cm spectra (Figure 10). The 
best restoration of the 21 cm signals can be obtained with the three-narrow-segment fitting 
technique (Figures 7 and 9; Table 4), and in single- wide-segment fittings the scatter of power 
restoration on > 6/i~^ Mpc scales is larger when compared with our three-narrow-segment 
fittings (Table 4). We also show the result of the two-narrow-segment approach, which is 
similar to three-narrow-segment approach but has one less segment. In general the results 
of two- narrow- segment approach are better than those of the single-narrow-segment separa- 
tions, but the goodness of separation is more or less sensitive to the choice of A^gap or Az/gap 



We also have applied above separation approaches to higher redshifts of =9, 10, 12, 
and 14, which correspond to reference frequencies of z/q =142.0 MHz, 129.1 MHz, 109.3 MHz, 
and 94.7 MHz for redshifted 21 cm signals, respectively (Table 5). At these redshifts, where 
the cosmic reionization model adopted in this work (Santos et al. 2003, 2005; §2.1) predicts 
that the HII bubbles are growing in their earlier stage and less extended, the separation 
quality of our three-narrow-segment approach is still the best, although statistically the 
separation quality of the single-wide-segment approach is almost equally good. 



In fiducial reionization models such as the one adopted in this work (Eq. (1)), the 
power of the redshifted 21 cm signals is mainly determined by the mean halo bias h and the 
average ionization fraction Xe at a given redshift (Fig. 11), whereas nowadays both h and Xe 
are poorly constrained and suffer large uncertainties among different theoretical reionization 
models (Santos et al. 2003). In future observations, well restored 21 cm powers will greatly 
help derive h and Xg, and then deduce HII bubble size and growth rate. Since powers 
integrated in different scale ranges follow different contour lines on the Xe — b diagram (e.g., 
w^|fc<i/i/Mpc and w\k<5h/Mpc on Fig. 11), it is feasible to constrain b and Xe simultaneously 
using the overlapping area of any two of these contours (Fig. 12), the accuracy of which 
depends on how accurate we can restore the corresponding 21 cm powers. In Figure 12 and 
Table 3 we clearly see that, for an ideal noise condition (1 niK), the tightest constraints 




(Figure 7). 



4.2. Constraints on EoR Physics 
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(6 = 3.50lo!io S'lid Xe = 0.84^q;o7) are obtained using the powers restored with our three- 
narrow- segment separation approach, when compared with the input model values {b = 3.50 
and Xe = 0.84; Santos et al. 2003). When using the single-wide-segment approach, the 
obtained b = 3.50;^q°2o and Xg = 0.83lQ;g5 are also unbiased but their errors are larger by 
about one order of magnitude compared with three-narrow-segment approach. When using 
the single-narrow-segment approach both of the constraints (2.85 < b < 14 and 0.14 < 
Xe < 0.98) are very poor. For a realistic noise condition (60 mK), the three-narrow-segment 
separation approach still works well {b = 3.55^^11 and Xe = 0.83~^q^I; Fig. 13 and Table 3) 
whereas the single- wide-segment approach introduces larger errors in the result (6 = 3.75^*^0 35 
and Xe = O.reto^^). In such a sense we conclude that the three-narrow-segment quadratic 
polynomial fitting technique is the best approach to restore the 21 cm signals. 



4.3. Detection of BAO signals at Intermediate Redshifts with Three-Segment 

Separation Approach 

We have also studied whether or not our three-segment separation technique can be 
applied to intermediate frequency bands to detect the characteristic BAO signals (Ansari et 
al. 2012), which can be used as a sensitive probe to investigate the nature of dark matter 
and to constrain cosmological models (e.g., Bharadwaj et al. 2009 for a review; Visbal et 
al. 2009; Bagla et al. 2010). We apply the single-narrow- segment, single-wide-segment and 
three-narrow-segment approaches to the redshift range of 0.35 — 0.65, which corresponds to 
861 MHz— 1052 MHz in detection. The power spectrum of the brightness temperature of the 
21 cm signals is assumed to follow that of the matter density field 

AD,21cm(^, Z) = T|;^^„^(2;)6^(2)P3D,matter(^, z) (7) 

(Seo et al. 2010), where T2icm{z) is the brightness temperature of the redshifted 21 cm signals 
averaged at z, and b{z) is the mean bias. According to Barkana & Loeb (2007), Pritchard & 
Loeb (2008), and Chang et al. (2008), b{z) = 1 after the cosmic reionization, and T2icm(-2) 
is estimated to be 

- 188xni{z)nn,oh{l + zY 

a/12m(1 + zy + l- ilu 
where Xm{z:) is the neutral hydrogen fraction at z, and Qnfi is the ratio of the total hydrogen 
mass density to the critical density at z = 0. In literature (e.g., Zwaan et al. 2005; Prochaska 
et al. 2005; Rao et al. 2006) a conservative estimation XHi(-z)f^H,o = 0.00037 is often assumed 
for the neutral hydrogen fraction at intermediate redshifts. 

As a simple test, we carry out both single- segment and three-segment separations 
(Azgap = 0.15, 0.20, 0.25 or Ai^gap = 100, 150, 200 MHz) at zq = 0.5 with a channel width 
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du = 0.1 MHz, a channel number = 500 (i.e., a segment width Az/gcgmcnt = Ndu = 50 
MHz), and o"noisc|2=o.5=20 mK. As shown in Figure 14 and Table 6, the three-segment separa- 
tion approach (e.g., ffit = 0.016±0.004 for ^segment = {0.5, 0.7, 0.3}) yields better results than 
the single-segment separation approach (rgt = 0.036 ± 0.015). The power loss in the recon- 
struction for the > 30h~^ Mpc scale structures is wioss|fc<o.2VMpc = 1 — 'U^|fc<o.2h/Mpc < 0.02 
with three-segment separation approach while wioss|fc<o.2fc/Mpc = 0.19 ± 0.16 with single- 
segment separation approach. Therefore we conclude that the three-segment separation 
approach can successfully restore the 21 cm signals from intermediate redshifts better than 
the single-segment approach, showing its potential to be employed in detecting the BAO 
signals that typically possess characteristic ~ 150 comoving Mpc scales. More details on the 
simulation of the BAO signals and its separation from the foreground will be presented in 
the next paper since it is beyond the scope of this work. 

5. SUMMARY 

In this work we construct a complex foreground model and compare between different 
separation approaches by carrying out statistical analysis of the restored 21 cm spectra 
and corresponding power spectra, as well as their constraints on mean halo bias b and 
average ionization fraction Xg. At z = 8 the best restoration of 21 cm signals and the 
tightest determination of b and can be obtained with the three-narrow-segment quadratic 
polynomial fitting technique proposed in this work. We illustrate that the new technique 
also works well at redshifts higher than 8, and is a potentially useful tool to probe the BAO 
at intermediate redshifts. 
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Ministry of Science and Technology of the People's Republic of China (973 Program; Grant 
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Nos. 10878001, 10973010, 11125313, and 11203041), the Shanghai Science and Technology 
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Table 1: Foreground Emission Components at 150 MHz (spatial resolution = 12 arcmin^). 





Component 


Mean Value 
(K) 


rms 
(K) 




Galaxy synchrotron emission 


237 


24 




Galaxy free-free emission 


0.877 


0.205 




Galaxy clusters 


0.084 


2.06 




Extragalactic discrete sources 


88 


7937 


Extra; 


galactic discrete sources (under S'cut) 


23 


29 




Total foreground 


326 


7937 


Total fore^ 


jround (with discrete sources under S'cut) 


261 


38 



Table 2: Parameters of our synthetic observations. 



Parameter 


Value 


Field of view 


10° X 10° 


Total effective area nA^s 


17658 m^ 


Number of baselines n{n — l)/2 


(81 X 80)/2 


Frequency coverage 


50 - 200 MHz 


Frequency resolution dv 


40 kHz 


Image spatial resolution il 


1 X 10"^ sr 


Observing time r 


1 yr 


(Tnoisc at 157.8 MHz 


60 mK 
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Table 3: Comparison between different separation approaches - cases at zq = 8 with 
o"noise 1 2=8=60 mK and ImK, and mean halo bias and average ionization fraction calculated 
from relative 21 cm power restoration"^. 





Relative Power Restoration 

w\k<lh/Mpc U!\k<5h/Mpc 


Mean Halo Bias Average Ionization Fraction 

b Xe 


Input model 


1.00 1.00 


3.50 0.84 


f^iioise \z—8 

=60 mK 


Single-narrow-segment 

(Azysogmcnt = 2 MHz) 


0.25 ±0.04 0.66 ±0.02 




Single-wide-segmcnt 

(Az/scgmo„t = 2 MHz) 


0.97 ±0.05 1.01 ±0.02 


o -- + 1.65 n r,f.+0.l2 
"^■'^-0.35 '-'•'O-0.26 


Three-narrow-segment 

(Al/scgmcnt = 2 MHz) 


1.00 ±0.05 1.01 ±0.02 


o cc+0.75 n OO-I-0.07 
"J-^^-O.lS ^•°'J-0.19 


(^noise 1 2:— 8 

=1 mK 


Single- narrow-segment 

(Ar/segment = 2 MHz) 


0.64 ±0.24 0.83 ±0.13 


[2.85,14] [0.14,0.98] 


Single- wide-segment 

(Al^sogmcnt = 20 MHz) 


0.98 ± 0.12 0.99 ±0.05 


o rn+10.5 n aQ-i-0.13 

O.OU_Q 20 ^•°>'-0.65 


Tliree-narrow-segment 

(Al^segmont = 2 MHz) 


1.00 ± 0.02 1.00 ±0.02 


3.5oig:?g ^.Mtr^ 



Note. " Calculations are carried out using the fiducial reionization model of Santos et al. (2003, 2005). 
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Table 4: Comparison between different separation approaclies - cases at 2:0 = 8 witli 

anoise|.=8 = l HlK (Fig. 7; §4.1)^ 





Gap Width 


' nt 


'"lfc<l/i/Mpc 


sin cIp-ti ?iT'rnw-t;pp'TnPTit, 

kyi-X± tLJ-v-- ±LCAfl. L KJ VV O t-^l.Xlv-'XXlJ 

l^^^scgmcnt — ^ iVinZj 






0.153 ±0.070 


0.64 ±0.24 


SiiTi o'1p-'\s7'i np-'^pfyrnPTil" 
(Az^segmont = 20 MHz) 






0.055 ±0.034 


0.98 ±0.12 






-1.5 


0.116 ±0.083 


1.09 ±0.51 






-1.0 


0.099 ±0.072 


1.03 ±0.35 






-0.5 


0.082 ±0.054 


0.97 ±0.23 






0.5 


0.062 ±0.037 


0.92 ±0.16 






1.0 


0.056 ±0.034 


0.91 ±0.14 


Two- narrow-segment 




1.5 


0.050 ±0.028 


0.92 ±0.13 


(Al/,cgmcnt 2 MHz) 




-30 


0.048 ±0.026 


0.91 ±0.12 






-20 


0.052 ±0.030 


0.91 ±0.13 




Afgap 


-10 


0.062 ±0.037 


0.91 ±0.16 




(MHz) 


10 


0.084 ±0.059 


0.96 ±0.22 






20 


0.098 ±0.069 


1.02 ±0.31 






30 


0.116 ±0.086 


1.13 ±0.84 






0.5 


0.028 ±0.010 


1.00 ±0.02 






1.0 


0.024 ±0.006 


1.00 ±0.02 


Three- narrow-segment 




1.5 


0.023 ±0.005 


1.00 ±0.02 


(A:/sogmcnt = 2 MHz) 


Ai/gap 

(MHz) 


10 


0.027 ±0.009 


1.00 ±0.02 




20 


0.023 ±0.006 


1.00 ±0.02 




30 


0.022 ±0.005 


1.00 ±0.02 



Note. ° Wc run 1000 rounds of simulation and separation for each separation approach, fat is the mean 
relative root-mean-square error for the restoration of the 21 em signals, and u'|/c<ih/Mpc is the mean power 
restoration on > 6h~^ Mpc scales. Errors indicate the 68% confidence limits. 
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Table 5: Comparison between different separation approaches - cases for zq = 9, 10, 12, 14 
with anoisc(i^) = (Z//157.8 MHz)-^ mK (§4.1)'^. 



«0 


Parameter 


Single-Narrow-Segment 


Single- Wide-Segment 


Three-Narrow-Segment 


Three-Narrow-Segment 






(Ai/scgmont = 2 MHz) 


(Al/sogmcnt = 20 MHz) 


(AZgap = 1) 


(Al/gap = 10 MHz) 


9 


ffit 


0.145 ±0.067 


0.053 ±0.028 


0.032 ±0.007 


0.033 ± 0.009 




"'lfe<lh/Mpc 


0.66 ±0.23 


0.99 ±0.10 


1.00 ±0.03 


1.00 ±0.03 


10 


fRt 


0.141 ±0.062 


0.058 ±0.025 


0.042 ±0.010 


0.043 ± 0.009 


"'lfc<lh/Mpc 


0.67 ±0.23 


0.99 ±0.11 


1.01 ±0.05 


1.01 ±0.05 


12 


ffit 


0.146 ±0.055 


0.075 ± 0.022 


0.067 ±0.014 


0.066 ± 0.013 


'^lfc<l?i/Mpc 


0.73 ±0.21 


1.01 ±0.11 


1.01 ±0.08 


1.01 ±0.07 


14 


ffit 


0.157 ±0.050 


0.099 ± 0.024 


0.094 ±0.020 


0.093 ± 0.020 


'''lfe<lh/Mpc 


0.75 ±0.22 


1.02 ±0.13 


1.02 ±0.11 


1.02 ±0.11 



Note. " Parameters are defined in the same way as in Table 4. 



Table 6: Comparison between different separation approaches - cases for zq = 0.5 (Fig. 14(a); 
§4.3)". 





Gap Width 




'w\k<0.2h/Mpc 


Single-sej 

(^^segment — 


rment 
50 MHz) 






0.036 ±0.015 


0.81 ±0.16 








0.15 


0.016 ±0.004 


1.00 ±0.02 








0.20 


0.016 ±0.004 


1.00 ±0.02 


Three-se^ 


;ment 




0.25 


0.016 ±0.003 


1.00 ±0.02 


(^^scgmcnt — 


50 MHz) 


Al/gap 

(MHz) 


100 
150 


0.016 ±0.004 
0.015 ±0.003 


1.00 ±0.02 
1.00 ±0.02 






200 


0.015 ±0.003 


1.00 ±0.02 



Note. ° Parameters are defined in the same way as in Table 4, whereas the wave number range is confined 
as fc < 0.2h Mpc^^ to calculate relative power restoration. 
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b I ' ' ' ' ' ' ' ' -^-^ 

^ 1 10 

k (h/Mpc) 

Fig. 1. — ID power spectra of 21 cm signals calculated at some typical redshifts by adopting 
the fiducial reionization model (§2.1; Santos et al. 2003, 2005). 
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Fig. 2. — Telescope arrangement of the 21CMA array. Each telescope is an assembly of 127 
logarithmic periodic antennas. 
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I , 1 , 1 , 1 , 1 " o 

-2000 2000 

Fig. 3. — Density of visibility measurements in the 21CMA uv plane at 150 MHz with an 
integration time of 24 hours (§2.2.2). 
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Fig. 4. — Difference between restored (TJ-^j.^) and input (Tfore) foregrounds calculated 
with single-segment separation approach by using different polynomial approximations 
(§3.1). The calculations are repeated for 1000 times for each given Ai^scgmcnt = 2z MHz 
(z = 1, 2, ■ ■ ■ , 40) to determine the mean value of \T^orc ~ ^forcl- 
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(a) 



x;=;,i96 

1-narrow-segment separation 




t t 



=1.365 
1 -wide-segment separation 



t 



(c) 



=1.137 

3-narrow-segment separation 



0.5 1 2 

k (h/Mpc) 



0.5 1 2 

k (h/Mpc) 



0.5 1 2 5 

k (h/Mpc) 



x;=125^ 
1 -narrow 



segment separation 



-t-T-r^-h--; 




=1.389 
1 -wide-segment separation 



' ■•-■J-,-r-5-i 



1 1 



-- o 



V =0.783 

3-narrow-segment separation 



1--T-!-H,-!-M. 



0.5 1 

k (h/Mpc) 



0.5 1 

k (h/Mpc) 



0.5 1 2 

k (h/Mpc) 



Fig. 5. — Mean restored power spectra for the 21 cm signals from 10000 sky pixels with 
Cnoisc 1 2=8=60 mK, as compared with the theoretical 21 cm power spectrum ai Zq = 8 (black). 
(a)-(c) are the condition with du = 40 kHz, and (d)-(f) with du = 100 kHz. Residuals 
obtained in the test between the restored and theoretical 21 cm power spectra in each 
case are also plotted in the lower panels. 
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£ ^ line of sight 




148.8 



152.8 



156.8 
V (IVIHz) 



160.8 



164.8 



Fig. 6. — A comparison between three separation techniques discussed in §3 and §4. Upper 
paneh Sketch diagram for the spatial distribution of ionized bubbles along line of sight at 
Zo = 8. Lower panel: Corresponding spectrum of the simulated 21 cm signals (black solid), 
along with the best-fits (red solid: single-narrow-segment; green solid: single- wide- segment; 
blue star: three- narrow- segment) of T^^^.^ — Tforc, which presents the Mpc-scale components 
of 21 cm signals that have been entangled with the foreground. Clearly the best-fit spectrum 
of single-narrow-segment is significantly biased by the Mpc-scale components of the bubble, 
and best-fit spectrum of single-wide-segment is biased by the tens of Mpc-scale components 
of the giant bubbles. The black dotted line marks T^^^^ — Tf^re = 0, which is the most ideal 
result for the restoration of the 21 cm signals. 
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Fig. 7. — (a) Fractional distributions of the relative root-mean-square error r^t for single- 
narrow-segment (black solid), single- wide-segment (red square), two-narrow-segment with z 
gap (magenta solid; upper left to lower right are l^z^^^ = 1.5, 1.0, ■ ■ ■ , — 1.5, respectively), 
two-narrow-segment with u gap (blue dash; upper left to lower right are Ai/gap = —30, — 
20, 30 MHz, respectively), three-narrow-segment with z gap (red solid; upper left to 

lower right is A^gap = 1.5, 1.0, 0.5, respectively), and three-narrow-segment with u gap 
(green dash; upper left to lower right are Az/gap = 30, 20, 10 MHz, respectively) separation 
approaches, each of which is calculated based on 1000 rounds of simulation and separation 
with criioiseU=8=l (Table 4; §4.1). (b) Average power spectra of input and restored 21 cm 
signals and their 68% dispersion limits. The restorations are performed with single-narrow- 
segment (blue) and three-narrow-segment (magenta) quadratic polynomial fitting techniques 
based on 1000 rounds of simulation and separation for each. 
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Fig. 8. — One case for successful separation with the single-narrow-segment quadratic poly- 
nomial fitting technique, for which the segment width is chosen to be 2 MHz, cXnoise 1^=8=1 
mK, and our complex foreground model is applied (§4.1). (a) Spectra of input 21 cm sig- 
nals T2icm, restored 21 cm signals plus noise TlJ'^, noise Tnoisc, and the difference between 
^2icm ^^'^ ^2icm are shown in black, blue, green and red, respectively, (b) Corresponding 
power spectra for (a). Power spectrum of the complex foreground (cyan) is also plotted for 
comparison. 
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Fig. 9. — Same as Figure 8 but for one case of failed separation at 2o = 8 with the single- 
narrow-segment approach (§4.1). Spectra are marked in the same colors as in Figure 8. 
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Fig. 10. — Relative root-mean-square error rgt for the single-narrow-segment separation at 
Zo = 8 and the power of Mpc-scale components of the simulated 21 cm signals (§4.1). When 
the fits of single-narrow-segment separation is not acceptable (rgt > 0.1), there exist a clear 
correlation between r^t and P2icm|fc<ih/Mpc, i-e., logP2icm|fc<i/i/Mpc = 0.89 log rgt - 1.95 (solid) 
as can be derived with maximum likelihood method. 
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Fig. 11. — Relative power restorations w for (a) > Qh^^ Mpc and (b) > Ih^^ Mpc scales, 
each of which is obtained at 2;o = 8 ((Tnoisc|z=8=l niK) with different separation approaches 
and averaged over 1000 times (Table 3), and their dependence on mean halo bias h and 
average ionization fraction Xg. Dashed lines show the 68% errors (§4.2). 
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Fig. 12. — Xe and b determined by overlapping the wlk^ih/Upc and |fc<5/i/Mpc contours shown 
in Figures 11(a) and (b) on the Xg—b map. The input model parameters (xe = 0.84, b = 3.50) 
are shown as a black diamond, meanwhile the best restored parameters are shown with a 
red cross. Allowed parameter ranges (68% level) are marked in yellow (Table 3). 




Fig. 13. — Same as Figure 12 but for o"noise 1^=8=60 mK. 
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